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Abstract. We study a continuous-time dynamical system that models the 
evolving distribution of genotypes in an infinite population where genomes 
may have infinitely many or even a continuum of loci, mutations accumulate 
along lineages without back-mutation, added mutations reduce fitness, and re- 
combination occurs on a faster time scale than mutation and selection. Some 
features of the model, such as existence and uniqueness of solutions and con- 
vergence to the dynamical system of an approximating sequence of discrete 
time models, were presented in earlier work by Evans, Stcinsaltz, and Wachter 
for quite general selective costs. Here we study a special case where the selec- 
tive cost of a genotype with a given accumulation of ancestral mutations from 
a wild type ancestor is a sum of costs attributable to each individual muta- 
tion plus successive interaction contributions from each fc-tuple of mutations 
for k up to some finite "degree" . Using ideas from complex chemical reaction 
networks and a novel Lyapunov function, we establish that the phenomenon 
of mutation-selection balance occurs for such selection costs under mild con- 
ditions. That is, we show that the dynamical system has a unique equilibrium 
and that it converges to this equilibrium from all initial conditions. 



1. Introduction 

Many phenotypic traits, including some genetic disorders, are thought to be 
polygenic and result from the possibly complex epistatic (that is, non-additive) 
interactions between large numbers of mildly deleterious alleles that are slowly 
weeded out of the population by natural selection but are constantly reintroduced 
by recurrent mutation. In particular, the Medawar- Williams-Hamilton |Med52[ 
Wil57, Ham66 explanation of the evolution of aging (see, also, [Cha94, ChaOlJ and 
the introductory discussions in JSEW05I IESW061 IWES08L IWSE08Q invokes this 
mechanism of mutation-selection balance. 

It has been a challenge to provide a sufficiently usable quantitative description 
of the mutation-selection phenomenon when many genes are involved (see, for ex- 
ample, [Hol95] or FKOOji in particular, Section 5.3.2 of the latter is entitled "Need 
for a mathematical framework" and specifically lays out the necessity for mathe- 
matically tractable approaches to outcomes that are modulated by the interaction 
of large numbers of genes) . 
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Evans, Steinsaltz, and Wachter [SEW05, ESW06] describe two related mathe- 
matical settings for the mutation-selection process. Here we consider the model of 
the latter paper. The key assumptions in |ESW06| are: 

• the population is infinite, 

• the genome may consist of infinitely many or even a continuum of loci, 

• reproduction is sexual, in that each individual has two parents and the 
mechanism of genetic recombination randomly shuffles together the genomes 
of the parents in order to obtain the genome of the offspring, 

• mating is random, 

• individuals are haploid (that is, an individual has only one copy of each 
gene rather than copies from each of two parents, as is the case for diploid 
organisms) , 

• mutations accumulate along a lineage from an ancestral wild type geno- 
type (that is, there is no back- mutation canceling out the effect of earlier 
mutations) , 

• fitness is calculated for individuals rather than for mating pairs, 

• genotypes that have more accumulated mutations are less fit, but otherwise 
selection costs are arbitrary, 

• the effects of recombination are felt on a faster time scale than those of 
mutation or selection. 

For the purpose of explaining our results, we now describe briefly the general 
model of ESW06J. Our results concern a particular family of selection costs that 
we introduce later. 

We write M. for the collection of loci in the portion of the genome that is of 
interest to us. We assume that there is a distinguished reference wild type genotype, 
and each locus represents a "position" at which the genotype of an individual may 
differ from that of the wild genotype. We have placed the word "position" in quotes, 
because we do not necessarily take M. to be something like a finite collection of 
physical DNA base positions or a finite collection of genes. From a modeling point 
of view, it is convenient to allow M to be more general. For example, the proposed 
explanations in Charlesworth |Cha01| for Gompertz mortality curves and mortality 
plateaus at extreme ages suggest that one relevant choice of M. may be a class of 
functions from R + to R+: the value of such a function at time t > represents an 
additional increment to the mortality hazard rate at age t conferred by a mutation 
away from the wild type at this locus. Some minimal amount of structure on M. is 
necessary in order to accommodate probability theory, and so we take M. to be a 
complete, separable metric space. 

We suppose that the genotype of an individual is described completely by the 
number of times a mutation event has occurred at each locus along the lineage con- 
necting the individual to a wild type ancestor. Similar assumptions are standard in 
models of the infinitely many alleles, infinitely many sites, and continuum- of- alleles 
type. In particular, all potential mutations at a locus that is currently wild type are 
indistinguishable, and further mutation events cannot undo the results of earlier 
ones (that is, there is no back mutation). When the space M. is continuous, it will 
typically be the case that multiple mutations will not occur along a lineage at any 
locus. For discrete spaces M. that are large, the assumption of mutation accumula- 
tion without back-mutation is reasonable when mutation rates are sufficiently low 
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that it is unlikely that any locus will be affected by a mutation more than once 
along a lineage. 

Consequently we may think of an individual's genotype as an element of the 
space Q of integer-valued finite Borel measures on A4, where the non- negative 
integer g(A) for g £ Q and A C Ai represents the total number of mutation events 
that have occurred at loci belonging to the subset A of the genome along the lineage 
connecting the individual to a wild type ancestor. An element of Q is a finite sum 
Y]- S mi , where S m is the unit point mass at the locus m £ M, corresponding 
to a genotype where there have been ancestral mutations at loci mi,m2, .... In 
particular, the wild type genotype is the null measure. As we remarked above, we 
do not require that the loci mi £ M arc distinct. For example, if M. is finite, so we 
might as well take M. = {1, 2, . . . , N} for some positive integer N, then a genotype 
is of the form X)j=i n i^ji indicating that ancestral mutations have occurred rij 
times at locus j. We may identify such a genotype with the non- negative integer 
vector n := (ni, ri2, ■ ■ ■ , un), so that Q is essentially the Cartesian product N w of 
N copies the non-negative integers. 

We imagine that the population we are interested in is infinite and that all that 
matters about an individual is the individual's genotype, so that the dynamics of 
the population can be described in terms of the proportions of individuals with 
genotypes belonging to the various subsets of Q. We therefore seek to model the 
evolution of probability measures Pt, t > 0, on Q, where Pt(G) represents the 
proportion of individuals in the population at time t that have genotypes belonging 
to the set G C Q. Equivalently, Pt(G) can be interpreted as the probability that 
an individual chosen uniformly at random from the population will have genotype 
belonging to G, so that Pt is the distribution of a random finite integer-valued 
measure on M.. For example, if = {1,2,..., N} and we identify Q with as 
above, then P t ({n}) represents the probability that an individual chosen uniformly 
at random from the population will have rij ancestral mutations at locus j for 
./ I--' v. 

We remark that although the introduction of this measure-theoretic formalism 
might seem somewhat heavy-handed, the use of measures to represent genotypes 
and probability measures on spaces of measures to represent populations has be- 
come quite standard in mathematical population genetics. It is apparent from 

EK86 how useful this perspective has been for understanding stochastic models of 
the infinitcly-many-allclcs and infinitely-many-sites type. Similarly, the stochastic 
model of [SH92] is based on a Poisson random measure on a Cartesian product 
of the space of sites and the unit interval (with the latter coordinate represent- 
ing frequency of mutant alleles at the corresponding site). Also, the recent papers 

BB03, Baa05, Baa07 adopt a measure-theoretic framework to present a determinis- 
tic model of recombination based on BaaOlj . The discrete-time mutation-selection 
models of |Esh71j and |Kin78j describe the population in terms of a probability 
measure over the real numbers that records the proportions of individuals with 
various fitnesses. A far-reaching continuous-time generalization of these models is 
presented in |BB96j (see also |Bur00] ) where individuals have an abstract type be- 
longing to a locally compact space and the population is described by a probability 
measure on this space. 

We next indicate how mutation, selection and recombination are modeled in our 
setting to obtain the evolution dynamics for P t . 
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Mutation alone. Suppose that there is only mutation and no selection or recom- 
bination. Because no genome has a selective advantage, all individuals are dying 
and reproducing at the same rate, and since there is no recombination the geno- 
type of an offspring is simply that of the parent with possible alterations due to 
mutation. Somewhat informally, we imagine that there is finite measure v on the 
space of loci Ai such that for any individual alive at time t > the probability 
a mutation occurs in the infinitesimal region dm C Ai during the time interval 
dt is dt x i>(dm), and we also assume that the occurrences of mutations are inde- 
pendent between different individuals, different times, and different regions of Ai. 
If we write P t $ = J g <&(g)P t (dg) for some test function $ : Q — > R (that is, 
is the expected value of the real-valued random variable obtained by applying the 
function $ to the genotype of an individual chosen uniformly at random from the 
population), then the content of these assumptions is contained formally in the 
equation 

j t P t § = P t (J [$(• + 6 m ) - *(•)] v{dm) \ . 

For example, when Ai = {1, 2, . . . , N} we have 
, n 
-P t ({n}) = KO'}) [Pt({n ej}) - P t ({n})] , 

j=i 

where ej is the j th coordinate vector. We stress that this equation is classical: it 
is a special case of the usual equation describing evolution due to mutation of type 
frequencies in a population where the set of types is and mutation from type n 
to type n + ej occurs at rate v({j}) - see, for example, Section III. 1.2 of |BurOO| . 

This evolution equation has a simple explicit solution. Let II denote a Poisson 
random measure on Ai x R + with intensity measure v ® A, where A is Lebesgue 
measure; that is, LT is a random integer-valued Borel measure such that: 

(1) The random variable H(A) is Poisson with expectation v ® X(A) for any 
Borel subset A of Ai x E + . 

(2) If A\ 1 A'2. 1 . . . ,A n are disjoint Borel subsets of Ai x K + , then the random 
variables n(^4fc) are independent. 

Define a ^-valued random variable Z t (that is, Z t is a random finite integer- valued 
measure on M) by Z t := J Mx r t i S m H(d(m, u)). Then P t $ = E [$(W + Z t )}, where 
W is a random measure on Ai that is independent of LT and W has distribution 
Pq. In particular, if Pq is the distribution of a Poisson random measure, then P t 
will also be the distribution of Poisson random measure. If we write pt for the 
intensity measure associated with Pt (that is, pt is the measure on Ai defined by 
pt(M) := Jg g(M) Pt(dg) for M C Ai), then pt evolves according to the simple 
dynamics p t (M) = p Q (M) +tv{M). 

Selection alone. Now consider what happens if there is only selection and no 
mutation or recombination. We assume that there is a selection cost function S : 
Q — * R+ with the interpretation that S(g') — S(g") for g',g" € Q represents the 
difference in the rate of population size increase between individuals with genotype 
g" and individuals with genotype g' . We make the normalizing assumption S(0) = 
and suppose that 



(1.1) 



S(g + h)>S(h), g,heG, 
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so that if the ancestral mutations from wild type in one genotype are a subset 
(taking into account multiplicity) of those in a second genotype, then the individuals 
with the second genotype have a lesser rate of increase than those with the first 
genotype. Heuristically, we have 



dt^ 



du J exp(-S(g")u)P t (dg") 



(PtS - S(g')) P t (dg'); 



That is, at time t > the per individual rate of increase of the proportion of the 
population of individuals with genotype g' is PtS — S(g r ). More formally, we have 



d_ 

dt 



Pt® — - Pt(® ■ [S - PtS]) 



S(g') 



S(g")P t (dg") 



Pt(dg'). 



For example, when M. = {1, 2, . . . , N} we have 



dt P ^ 



S(n')-$>t({n"})S(n") 



Pt({n'}). 



We again stress that this equation is classical: it is a special case of the usual 
equation describing evolution due to selection of type frequencies in a population 
where the set of types is - see, for example, Section III. 1.2 of [BurOOJ. 

If S is non-epistatic (that is, S has the additive property 5 l (J^ i 5 m< ) = 
S(5 mi )), then there is no interaction between the selective effects of ancestral 
mutations at different loci or multiple ancestral mutations at a single locus: in 
particular, if Pq is the distribution of a Poisson random measure then P t will also 
be the distribution of a Poisson random measure and, writing p t for the intensity 
measure associated with P t as before, we have 



Pt(dm') = p (dm') - 



S{6 m >) S(5 m „)p s {dm") 



M 



p s (dm') ds. 



However, if S is epistatic (that is, non-additive), then Pt will, in general, not be 
the distribution of a Poisson random measure - even when Pq is. 



Recombination alone. Lastly, we discuss the incorporation of recombination. 
Our description will be brief, as the details will not be important to us, because 
we are interested in an asymptotic regime where recombination occurs on a faster 
time-scale than mutation or selection, and in the limit the detailed features of the 
recombination mechanism disappear. The effect of recombination is to choose an 
individual uniformly at random from the population at some rate and replace the 
individual's genotype g' with a genotype of the form g'(- n M) + g"{- fl M c ), where 
g" is the genotype of another randomly chosen individual, M is a subset of M. 
chosen according to a suitable random mechanism, and M c is the complement of 
M. Thus, recombination randomly shuffles together two different genotypes drawn 
from the population. If recombination alone is acting, P has the property that 
there does not exist an m S M. with Po({g € G '■ g{{m}) > 0}), and the mechanism 
for choosing the so-called segregating set M is such that, loosely speaking, if m! and 
to" are two loci then there is positive probability that to' G M and to" G M c , so 
that no region of Ai is immune from shuffling, then Pt will converge to a Poisson 
random measure with the same intensity measure as Pq as t increases - see [ESW06 
for details. 
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Combining mutation, selection and recombination. We have seen that if 
Pq is the distribution of a Poisson random measure, then mutation preserves this 
property, while (epistatic = non-additive) selection and recombination respectively 
drive the system away from and toward Poisson. If all three processes are operating 
and we consider a limiting regime where recombination acts on a faster time scale 
than mutation and selection, then we expect that the resulting system will preserve 
the Poisson property. Results to this effect are established in | ESW06 : more pre- 
cisely, it is shown there that if one considers a discrete generation evolution in which 
mutation and selection are modeled by discrete-time analogues of the differential 
equations described above, recombination is modeled as above, and time is scaled 
so that the time between generations converges to zero, then the discrete-time 
evolution converges to a continuous-time evolution that preserves Poisson initial 
conditions, provided that asymptotically recombination acts on a faster time scale 
than mutation and selection. Moreover, the explicit details of the recombination 
mechanism in the discrete generation evolutions have no influence on the limit: 
all that matters is that the relative effect of recombination becomes stronger and 
stronger in the limit, and, as described above, that no region of At is immune from 
the shuffling effect of recombination. We refer the reader to [ESW06] for rigorous 
statements and proofs; however, we do stress that the analysis of [ESW06 does 
not start from an a priori assumption that loci are unlinked, rather it delineates 
how strong the Poissonizing effect of recombination has to be in order to overcome 
the tendency of non-epistatic selection to induce linkage between loci. 

The limiting evolution is thus a family of probability measures Pt on Q that are 
Poisson with intensity measure p t . If we write X™ for a Poisson random measure on 
Ai with intensity measure 7r, then, as we expect from combining the observations 
above, pt satisfies the evolution equation 

(1.2) p t (dm) = p (dm) +tv(dm) - [ E[S(X P ° + 5 rn ) - S{X P ')] p s (dm) ds. 

Jo 

For example, when At = {1, 2, ... , N} we have 

jMm = K{i» - pt({j}) E + e i) - II exp(-MW)) Wi n ! ■ 

n k=l k ' 

Note that although there are infinitely many "types" in this latter finitely-many- 
loci special case, the evolution of the population is described by a system of N 
differential equations for the N real quantities Pt({j}), 1 < j < N. 

In this paper we consider selection costs S that are, in a suitable sense, polyno- 
mial. Roughly speaking, this means that, for some finite positive integer N, the 
selection cost of a given genotype (that is, a given collection of accumulated muta- 
tions from the ancestral wild type) is a sum of non-negative contributions from each 
individual mutation, plus a sum of non-negative contributions due to interactions 
between the mutations in every pair of mutations, and so on - all they way up to 
a sum of non-negative contributions due to interactions between the mutations in 
every iV-tuple of mutations. 

We show for such selection costs that, under very mild conditions, the system 
(|1.2|) has a unique equilibrium and that the system converges to this equilibrium 
from any initial condition pg that is absolutely continuous with respect to v. 
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Because denning precisely what we mean by a polynomial selection cost and 
describing our results for a general locus space M. requires a certain amount of 
extra notation, for the sake of this introduction we just describe our results in the 
special case when Ai is the finite set {1, N}, and leave the general case to the 
body of the paper. 

In the finitely-many-loci case, the mutation rate measure v is defined by its value 
on singleton sets and we write Vi := v({i\). We assume that v$ > for all i, and 
let v also denote the vector (yi, . . . , vn)- As we remarked above, a genotype g G Q 
can be encoded by an ordered N~ tuple of non-negative integers g = (g\,... , g_/v), 
where gk represents the number of ancestral mutations that are present at locus k. 
A polynomial selection cost is one of the form 

(1.3) S(ff)=$>^> 

where the sum is taken over all nonempty subsets / C {1, . . . , N} and we adopt 
the convention that for a vector v the notation v 1 denotes the product His/ 1 "'- 
The constants au\ for 1 < i < N measure the selective cost of a mutation at 
locus i alone, whereas the constants ai for subsets / C {1, . . . , N} of cardinality 
greater than one measure the selective cost attributable to interactions between 
the mutations at loci in / over and above that attributable to interactions between 
mutations in sets of loci that are proper subsets of /. We assume that aj > 
for all /. This assumption ensures that the monotonicity condition holds. It 
is, of course, not a necessary condition for (jl.ip to hold, but it will be crucial in 
our analysis. Indeed, it will be shown in a forthcoming paper by Evans, Steinsaltz 
and Wachter that there are some natural infinite dimensional systems that arise 
in applications to aging where the analogue of (|1.3|) fails to hold and the resulting 
system does not converge to an equilibrium. 

Write ...,pjv(i)) for the vector corresponding to the intensity measure of 

P t . As we show in Section[2l the evolution equation (II. 2|) is equivalent to the system 
of ordinary differential equations 

(1.4) p k =v k -J2 onp 1 , l<k<N, 

i elk 

where Ik denotes the collection of subsets of {1, ... , N} that contain k. The fol- 
lowing is an immediate corollary of our main result for general locus spaces M, 
Theorem 13.11 

Corollary 1.1. Suppose that a^y > for 1 < i < N. The system |J.4| ) has a 
unique equilibrium point in the positive orthant Mrf, and this equilibrium is globally 
stable (that is, the system converges to the equilibrium from any initial conditions 
in 

Note that, in this finitely-many-loci case, the problem of determining the equi- 
librium point reduces to finding the unique zero in the positive orthant of a set of 
N equations in N variables, and there is an extensive body of theory in numerical 
analysis and computational commutative algebra devoted to this problem. 

We finish this introduction with a very brief indication of the substantial litera- 
ture on multilocus deterministic models in population genetics and the particular 
import of such models for the phenomenon of mutation-selection balance. 
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The standard mathematical reference on population genetics models in general 
(both deterministic and stochastic) is |Ewe 04j by Warren Ewens. This is a revision 
of a book from 1979 and the treatment of deterministic models is not expanded sig- 
nificantly from the original. In [ET77 , Ewens and Glenys Thomson also made one 
of the earliest contributions to understanding equilibria of deterministic multilocus 
genetic systems in which no special assumptions are made about fitness structure. 
While mutation only occurs from the wild type to the derived type in our model, 
the models in [ET77] (in common with many other models in the literature) allow 
several alleles at a locus and mutation from any allelic type to any other. 

A very comprehensive recent reference that concentrates on the infinite- 
population, deterministic aspects of population genetics is Reinhard Burger's book 
[BiirOO] (see also the Burger's review paper [Biir98j ). In particular, these works 
consider at length deterministic haploid continuum- of- alleles models in which in- 
dividuals have a type which is thought of as the contribution of a gene to a given 
quantitative trait. The type belongs to a general state space that represents some- 
thing like the trait value (in which case the state space is a subset of R) and types 
have an associated fitness, which is some fairly arbitrary function from the type 
space to (0, 1]. The type may be regarded as the combined effect of a multilocus 
genotype, but the models do not explicitly incorporate a family of loci, the con- 
figuration of alleles present at those loci, or a function describing the fitness of a 
configuration: everything is cast in terms of how fit each type is and how likely one 
type is to mutate into another. 

The development in [ESW06j leading to the model we consider here was very 
much influenced by the discrete time multilocus models of [TB90, BT91, KJB02 
that incorporate arbitrary forms of selection, general modes of inheritance, and 
other evolutionary forces such as migration and mutation. Also, the model studied 
here and in |ESW06| is essentially the result of adding strong recombination to 
the model of |SEW05j . which may itself be thought of as a generalization of the 
infinitesimally rare alleles model of |KM66j (see also, |Kon82j ) as it is cast in 
|Daw99| . 

Certain classes of mutation-selection models without recombination are solved 
explicitly in .WBG98, BW01 a using ideas from statistical mechanics. Such models 
may be thought of as either multilocus systems with complete linkage or structured 
single locus systems. Finally, we have already mentioned the constellation of papers 
BB03, Baa05, Baa07 ( BaaOl presenting a deterministic model of population change 
due to recombination alone. 



2. The Model 

In this section, we review the details of the model in f |ESW06] ) and derive the 
relevant equations in the special case of polynomial selection costs, to be defined 
below. 

Let A4, G, v, and S be as above. That is, M is the complete, separable metric 
space of loci, G is the space of integer-valued finite measures on M representing 
possible genotypes, v is the finite measure on Ai describing the rates at which 
mutation events occur in different parts of the genome, and S : G — > R+ is the 
selection cost function. As above, we assume that S(0) = and S(g + h) > S(g) 
for g,h £ G- 
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For any finite measure tt on A4, let J" denote a Poisson random measure on 
M. with intensity 7r (so that X™ is a random variable with values in Q). Define a 
function F v : M — > R+ by 

(2.1) ^(m)=E[5(X 7r +J ro )-5(X 7r )] 

That is, Fnirn) measures the average change in selection cost when mutation at 
locus m € M. is added to the random genotype I". 

Let Dn be the measure on M., absolutely continuous with respect to 7T, with 
density (that is, Radon- Nikodym derivative) F w : 

(2.2) &l {m) = F ^ m) . 

air 



In this notation, equation (|1.2[) becomes 



(2.3) p t = p a + tv - I Dp s ds, 

Jo 

where, as above, pt is the finite measure on M. giving the intensity measure of 
the Poisson random measure with distribution P t . We may write (|2.3[) somewhat 
informally as 

r* 

Pt(dra) = po(dm) + tv(dm) — / F Ps (m)p s (dm) ds. 

Jo 

It was shown in [ES W06] that (gSD has a unique solution for any initial condition 
Po provided that the selection cost S satisfies a Lipschitz condition with respect to 
the Wasserstein metric on Q. We take a slightly different perspective here. Let 
L°°(A4,v) denote the usual Banach space of (equivalence classes of) z^-essentially 
bounded functions on M, and write L^(A4, v) for the subset of L°°(A4, v) consist- 
ing of non-negative functions. There is, of course, a bijection between L+(Ai,v) 
and the space K, of finite measures on M that are absolutely continuous with re- 
spect to v with ^-essentially bounded Radon-Nikodym derivative, and so we can 
metrize K, using the L°°(M, v) metric on L°£{M, v). An argument essentially the 
same as that in [ESW06] establishes the following. 

Lemma 2.1. Suppose that S is such that F^ £ L c £(M,v) for it 6 K, and there 
exists a function H : R + — > R + for which 

d(Dn') d(Dn") 



dv dv 



\F n ,<j>' - iW'||co < HiWWoo V \\<t>"U)W 



when 7r',7r" 6 /C with it' {dm) = 4>'{m)v(dm) and 7r"(dm) = cj>" (m) v(dm) . Then 
A2.3\) has a unique K,-valued solution for any po G /C. 



From now on we will assume that the selection cost <S* is polynomial, by which 
we mean that 

N 

(2.4) S(g) = J2 a n (m)g® n (dm) 

for some positive integer N , where for each n the Borel function a n : M. n — > R+ 
is permutation-invariant (that is, a„(7rm) = a„(m) for all permutations 7r) and 
has the property that a n (m) = if there exist i ^ j with rrii = rtij. Further- 
more, we assume that each function a n is bounded. The number n\ a n {m\, . . . , m n ) 
represents the interactive effect of possessing mutations at the n loci mi,... ,m n 
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over and above that of possessing any subset of them, and this additional effect is 
independent of the order in which the mutations are written. 

Remark 2.2. When M. is the finite set {1, ...,N} and, as above, we encode a geno- 
type g £ Q as an ordered N— tuple of non-negative integers g = (g\, . . . ,<7iv), 
where represents the number of times an ancestral mutation has occurred at 
locus k, then the expression for S(g) in (|2.4|) coincides with that in (|1.3|) if we set 
cti = n\ a„(m) when / is the subset {mi, . . . , m„} C {1, . . . , N} = M. and m is the 
vector (mi, . . . , m„). 



We will now derive the particular form that the equation (12. 3|) takes for poly- 
nomial selections costs. Fix n € K, and, as above, let I" be a Poisson random 
measure with intensity tt. Then, for each m e M., we have 

N 



S{X" + S m ) = V / a n (m)(X« + 5 m )® n (dm.) 

= jbj Mn a «( m ) (e (*) (**)** ® ^ (n_fe) ) ( rfm )- 

Note that the symmetry of a„ allows us to rearrange the order of variables in the 
integral. Thus, 

5(^ + «5 m ) = EE j / «n(m)(r)*®e" fc) W 



= V / a n (m)(A-' r )® n (dm) + n / a n (m) (X^)^™" 1 ) ® <J m (dm) 

„_1 UM" J M n 

because all other integrals are by the assumptions on a n . Hence, 

N 

(2.5) S(X* + 8 m ) = S(X*) + Vn / a»(m, m) (X n )^ n -^(dm), 

where we write a n (m, m) := a„(mi, . . . , m n _i, m) for m = (mi,...,ro B -i) G 
7W (n_1 ^ and m £ M, and by convention, J M0 oi(m,m) (l") 8 '"^^) = ai(m). 
Therefore, 



(2.6) F 7I (m)=E[S(X 1T + S m )-S(X lr )} = y"n [ a„(m, m) /^'((im), 

by the independent increments property of Poisson random measures. 

Remark 2.3. Suppose that A4 is the finite set {1, ...,N} and, as above, we encode 
a genotype g £ Q as an ordered N— tuple of non-negative integers g = [g\, . . . , gN) 
and set ai = n! a„(m) when / is the subset {mi, . . . , m„} C {1, . . . , N} = Ai and 
m is the vector (mi, . . . , m n ). Then 

F„(k) = ai^ 1 , keM, 

where we identify the measure it on M. with the vector (tti, . . . ,7Tjv) and we use the 
notations of the Introduction that tt 1 = Yiiei n i an d T-k is the collection of subsets 
of M. that contain k. Consequently, equation (|2.3[) becomes equation (|1.4[) in this 
special case. 
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Returning to the general case, it is straightforward to check that the condition 
of Lemma |2~T1 holds for our choice of the cost function S, and so (|2.3|) has a unique 
K- valued solution for any initial condition po £ fC. We will henceforth suppose that 
Pa G fC. 

Write (f>t for a choice of the Radon-Nikodym derivative of p t with respect to v. 
It will be convenient to turn our measure-valued integral equation (|2.3jl for t 1— > p t 
into a function-valued differential equation for the corresponding Radon-Nikodym 
derivatives with respect to v as follows. 

For x G L°?(M, u), define Gx £ L5°(.M, i/) by 



Gx{m) :- 



' AT 

E 

.n=l 



ii I a„(m,m)a;(mi)---x(m„-i)j' 8 '" 1 ^(c?m) 

A4(»-i> 



x(m). 



Lemma 2.4. Suppose that p t is the solution of 12.3]) . Then there is a non-negative 
Borel function (t,m) <— > x t (m) on R_|_ x .M smc/i i/iai i/je function m <— > Xt(m) is 
a Radon-Nikodym derivative of pt with respect to v for all t > 0, and for v-a.e. 
m £ A4 the function t <—> xt (m) is differentiate with 



(2.7) 



x t (m) = 1 - Gx t (m), 



Proof. For each t > 0, let <f>t be a Radon-Nikodym derivative of p t with respect 
to v. We may write B(M), the Borel cr-field on M, as = c(U fe J^), where 

JFi C JF 2 C . . . is a sequence of finitely generated sub-cr-fields. It is therefore clear 
from a concrete description of the Radon-Nikodym derivative in terms of a limit 
of JF/j-measurable ratios (see, for example, Section III-l of [Nev75j) that we may 
suppose that the map (t,m) — ► 4>t(m) is Borel measurable. 
By Fubini's theorem, 

t 



4>t(jn) v(dm) 



4>o(m) v(dm) +t / v(dm) 
A J A 



F Ps (m)4> s (m) ds 



is (dm) 



for any Borel set iCAi. Thus, for each t > we have for v-a..e. m £ M. that 



(2.8) 



<t>t(rn) = <M TO ) + t 



F Ps (m)cj) s (m) ds =: ip t (m), 



by the uniqueness of the Radon-Nikodym derivative. A fortiori, (|2.8j) holds for 
A® f-a.e. pair (t, m) 6 R + x (recall that A is Lebesgue measure on R+). Thus, 
by Fubini's theorem, it follows that, for v-a,.e. m £ Ai, the equation (12. 8j) holds 
for A-a.e. t > 0. Also, p t is the measure -0 t (c?rn) := ipt( m ) v(dm) for all £ > 0, and 
hence (m) = F^ (m) for all t > 0. Thus, for i^-a.e. m S and for all t > 0, 

/ Fx (m)ips(m) ds = / F Ps (m)(f> s (m) ds. 
Jo s Jo 

By definition, ipo = 4>q, and so, for u-a.e. m £ M and for all t > 0, 

V>t(m) = tp (m)+t- Fx (m)ip s (m) ds. 
Jo 

Therefore, for v-a.e. m £ M. the function 1 i— > tp t (m) is differentiable with 

^t(m) = 1 - F^ t (m)ip t (m) = 1 - Gip t (m), 
and setting xt = V^t gives the result. □ 
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3. Equilibria and Stability 



We can now state our main theorem. 

Theorem 3.1. Suppose that E := inf{ai(m) : m G A4} > 0. Then the system 
\2. 7| ) has a unique equilibrium point in the space L c £(M.,v). Furthermore, this 
equilibrium is globally stable, in the sense that all solutions to \2. 7| ) converge to it 
in the L°°(Ai, v) norm as t goes to infinity. 

The proof will be accomplished in four parts. First, we derive some upper and 
lower bounds on solutions to (|2.7p . Then, we establish existence of at least one 
equilibrium point by a fixed point argument and show uniqueness using a gener- 
alization of a method of Craciun and Feinberg 1CF05] , Finally, we obtain global 
stability of the equilibrium via the construction of a Lyapunov function. Since we 
are dealing with an infinite dimensional dynamical system, all of these steps rely on 
some technical lemmas concerning the continuity of certain operators on L°^(A4, v), 
and we defer these to Section 0] 



3.1. Estimates on the solution. Let Xt be a solution to the equation (12. 7| . 
Since Gxt(m) > a\{rn)xt{rn), we see for u-a.e. m G M. and all i > that Xt(m) < 
1 — ai (m)xt(m). Hence, 

1 - (1 - a 1 (m)x (m))e~' a ^ m '> t 



(3.1) 



x t {m) < 



ai(m) 

for u-a.e. m G M. and all t > 0. As t — * oo the right-hand side of (|3.ip either 
increases to the value l/ai(m), in the case that xo(m) < l/ai(m), or otherwise 
decreases to the same value. In either case, we have for v— a.e. m £ M that 
xt{m) < max{a:o(m), l/ai(m)} for all t > 0. Thus, 

(3.2) sup ]]a; t ||oo < K, 

t 

where K = ||x ||oo V A with A := E^ 1 . 

Similarly, since for all t > we have Xt(m) < K for ^-a.e. m £ M, it follows 
that 

N 



(3.3) Gx t (m) < 



E 



n I a n (m,m)K (n - L >iy QS(n - 1 >{dm) 



for some constant C . Therefore, 

\-{l-Cx Q {m))e- ct 



x t (m) < Cxtim), 



x t {m) > 



> 



1 - e- Cf 



C 



C 

+ x (m)e 



-ct 



> 



1 



,-Ct 



C 



Hence, 
(3.4) 



lim inf ess inf xt (m) > 1/C > 0. 



3.2. Existence of equilibria. Observe that x G LT(M, v) is an equilibrium point 
of (|2.7|) if and only if Gx(m) — 1 for t/-a.e. m G A4, that is, if and only if a; is a 
fixed point of the map T : L+(M,v) — > (A4, v) given by 

-l 



(3.5) Ty(m) 



N 



a„(m, m)y(m 1 ) ■ ■ ■ y(m n -i) v® {n 1} (dm) 
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Note for any y G L+(M, v) that 
N f 

i(m)<V"n/ o n (m,m)y(mi) • • -y(m n _i) ^® ( ™ 1:1 (dm). 



JV 
n=l 

Hence, 



(3.6) Ty(m) < A, for !/-a.e. to g A4. 

Thus r does indeed map (A4,^) into itself. Moreover, if y(m) < A for u-a.e. 
to G A4, then 

JV 



V" a n (m,m)y(mi)---t/(m n _i)z/® ( " ^(dm) 

Vn / a„(m,TO)A ( "- 1) i/® (,l - 1) (dm) =: B' 1 , 



n=l 

N 

< 

n=l 



so that 

(3.7) B < Ty(m), for i/-a.e. to e 7W. 
Therefore, T maps the convex set 

(3.8) := {y G L+ (A*, v) : B < y(m) < A) 

into itself. Recall that the Banach space L°°(AA, v) is the dual of the Banach space 
l}(M.,v) under the pairing (x,f) := j M x(rn)f(rn)v(dm) for x G L°°(M.,v) and 
/ G L l (M,v). Recall also that the weak* topology on L°°(A4,i/) is the weakest 
topology that makes each of the maps x 1— ► (x, /), / G i 1 (A^, ^), continuous. A con- 
sequence of the Banach- Alaoglu Theorem is that that any weak*-closed and norm- 
bounded subset of L°°(M.,v) is weak*-compact (see Corollary 3 of Section V.4.2 
of |DS88j ). and so the set R is weak*-compact. The map T is weak*-continuous 
on the convex weak*-compact set R by Lemma 14.21 below . An infinite-dimensional 
extension of the Brouwer Fixed Point Theorem, the Schauder-Tychonoff Theorem 
(see Theorem 5 of Section V.10.5 in |DS88| ). guarantees that T has a fixed point in 
R. Consequently, the system (|2.7[) has at least one equilibrium point. 

Remark 3.2. Note from the argument above that if x G L^(A4, v) is any equilibrium 
point of (|2.7p . then x(m) — Tx(m) < A for z^-a.e. to G M by (|3.6[) . and so x G R 
by (j3~71l . 

3.3. Uniqueness of the equilibrium. We claim that there is only one solution in 
L+(Ad, v) to the equilibrium equation Gx{m) = 1, z^-a.e. to G M. Our argument 
follows that used in Theorem 3.1 of ICF05] to establish a criterion for the uniqueness 
of equilibria in the finite dimensional mass-action kinetics systems of differential 
equations that arise in the study of continuous flow stirred tank reactors. We refer 
the reader to [CF05] for a discussion of the history of related results. A recent paper 
on the use of algebraic geometry in analyzing instances of such systems is [CDSS07 , 
which also contains several references to other applications of this general class of 
dynamical systems. 

By Remark l3.2[ it suffices to show that there is only one solution in the set R. 
Assume otherwise, that is, that there are two solutions x G R and y G R with 
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x ^ y. In particular, we have that Gx(m) — Gy(m) for v-&.e. m £ M., and so 
N f 

N f 



n=l 



ra=l 

Therefore, 

iV 



/ n a n (in,m)x(mi) ■ ■ ■ x(m n —i)x(rn) 

\x{mi) ■ ■ ■ x{m n -i)x(m) J 



n=l 
X 



for i^-a.e. m 6 M. Setting K n (m, m) := a n (m, m)j;(mi) • • • i(m n _i)i(m) and 
S(m) := log (y(m)/x(m)), we obtain 

v „ 

Vn #„(m, m) f e «(mi)+-+*K-i}+«(m) _ j\ ^(n-i) ( dm ) = q 

Observe that <5(m) is bounded since x and y are in R. Thus, putting 

e <5(mi)H h(5(m„_i)+(5(m) _ 



f?„(m, m) := K n (m, m)- 



S(mi) H h <5(m„_i) + 5(m) 1 

we get 

v 

(3.9) Vn r/„(m,m)(5(TOi) + --- + 5(TO„_ 1 ) + (5(m))^™- 1 )(dm) =0 

„=i JMO-v 

for j/-a.e. m S M.. Note that the function rj n is non-negative, since $(mi) + • • ■ + 
S{m n -i) + 6(m) and e <5(mi)+-+<5(m»-i)+<5(m) _ x haye the same gign _ Alg0; ^ ig 

permutation invariant and takes the value whenever two of the coordinates of m 
are equal. 

Integrating the left-hand side of (|3.9p against the function 8 gives 
= / ?y„(m, m) 

x (<5(mi) H h S(m n -i) + S(m)) ^ n ~ 1 \dm)5(m) v(dm) 

N 

= ^2 n rj n (mi,...,m n )(8{mi)-\ h 5{m n -i) + <5(m„)) 



„=i -M" 
x (5(m„)z/® n (dmi, . . . , dm„) 

= y^y^ / Vn(mi, . . . ,m n )(S(mi) H h<5(m„)) 

„=1 fe=l 

x 5(m k ) v® n (dmi, . . . , dm n ), 
by the symmetry of ry„ together with the change of variables 

(mi, . . . ,m fe , . . . ,m n ) <-> (mi,...,m„,...,m fe ) 
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once for each k. 
Therefore, 



= ^ / Vnim!, . . . , 
n=l Jm " 



m n ) [6 (mi) + ■ • • + S(m n )] 2 v® n (dmi, . . . , dm n ). 



In particular, = J M rii(m)5(m) 2 v(dm). Since ai(m) > E > for all m G A4, it 
follows that rji(m) > for all m G .M , and hence S(m) = for ^-a.e. m G M.. This 
contradicts our assumption that x ^ y, and so the equilibrium point is unique. 

3.4. Stability. Let x t be a solution of flUT]). Write a;* G R C L!jf(.M,i/) for the 
unique equilibrium point guaranteed by Subsection 13 . 2 1 and Subsection l3.3l We will 
show that linif^oo \\x t — x*||oo = using a variant of Lyapunov's second method. 

A recent review of Lyapunov function methods for studying the asymptotic be- 
havior of finite dimensional nonlinear systems is [LR04J . Much of the finite dimen- 
sional theory considers Lyapunov functions that are continuously differentiable. 
General stability results for finite dimensional systems that do not assume this de- 
gree of smoothness are discussed in |CLH99j , where there is a number of references 
indicating why weaker assumptions are natural in several applications. Infinite di- 
mensional systems are considered in |Daf 78 , primarily in the context of partial 
differential equations. Discontinuous Lyapunov functional appear there because 
it is natural to work with a weak* topology in order to make norm-bounded sets 
compact, but functionals that are continuous in the norm topology may cease to be 
continuous in the weak* topology. This is exactly the situation that confronts us. 
Because we haven't found a result in the literature that applies directly to estab- 
lish global stability in our setting, we provide the details of the relatively routine 
argument. 

Define the function V : L°£(M,v) -^tU {+00} by 



Observe that V is, indeed, R U {+oo}-valued and even bounded below, because 



(3.10) 





(3.11) 




> (1 + log(.E)) x u(M) > -00, 



since au — log(w) > 1 + log(a) for all u > 0. 
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Set V t := V(x(t)). We have 



4v« = Hm 1 



log(x t+fc (m)) i/(dm) 



A4 



log(xt(m)) v(dm) 



M 



(3.12) 



lira — 

/t->o h 



ai(m)x t +h(m) v(dm) 



M 



a\{m)xt{m) v(dm) 



M 



1 N 
lim - y 

n=2 



a„(m) 



.VI" 



x [af t+ /i(mi) • ■ ■ x t+ /,(m n ) - x t (mi) ■ ■ ■ x t (m n )] v® n {dm). 
From (|3~2")) and (|3T4"|) there exist constants C > 0, X > 0, and T > such that 



(3.13) 



— — < essinf x t (m) < esssupx t (m) < K, t >T. 

26 "i m 



Hence, by the Dominated Convergence Theorem, we may interchange the limit and 
integrals in (|3.12[) to get 

x t (m) 



v(dm) 



ai(m)xt{m) v(dm) 



E n 



n=2 



(3.14) 



.VI x t( m ) J M 

a n {va)x t (mi) ■ ■ ■ xt{m n -\)x t {m n ) v® n (dva) 
1 



M" 



-x t (m)) 



M 

N 

E> 

n=2 



ai(m) 



xt(m) 

o n (m)a; t (mi) • -itK-i) ^'"'^(dm) 



i>(dm) 



itim)—— 



x t (m) v{dra) 



< 



for all t > T. 

Define V : L+{M,v) 



(3.15) 



V(x) 



U {-oo} by 

(1 - Gx{m)) 2 



1 



v(dm). 



IM x ( m ) 
From (gmp and $T7$, we obtain = V(x t ) for f > T. 

By Lemma 14.31 the function V is upper semicontinuous in the weak* topology 
on the domain {x s L+(Ad,v) : < essinf m x(m) < esssup m x(m) < K}. Also, 
if x is in this domain, then V{x) — if and only if Gx{m) = 1 for v— a.e. m € M., 
that is, if and only if x = x*. It follows that for any e > the supremum of V on 
the weak* compact set 

{x 6 LT(A / (,^) : — < essinf x(m) < ess sup xirri) < K, \\x — x*^ > e} 
26 m m 

is strictly less than 0. 

Assume that ||a:( — x*||oo does not converge to as t — > oo. Then there exists 
S > and a sequence tk — > oo with > T for all fc such that ||xt fc — a;*||oo > 5 
for all fc. Now, since x t is uniformly bounded as a consequence of (|3.2p . (|3.3|) . and 
(|3.4p . there is a number 7 > independent of k such that ||a; t — aft fc ||oo < <V2 for 
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tk < t < tk + 7, so that \\xt — x*\\oo > 5/2 for tk < t < tk + 7- It follows that there 
is a constant A < independent of k such that 

(3.16) ^-V t = V(x t ) < A < 

at 

for tk < t < tk + 7. Because V* is nondecreasing for i > T by (|3.14p . the inequality 
(|3.1Gp contradicts the conclusion from (|3 . 1 1 1) that inf t V t > — oo. Thus \\x t — £*||oo 
must converge to as t — > oo. 

Remark 3.3. Put v := c -1 ^, where c := 2/(.M), and yt := = cx c -i t . Then, 
yt(jn) = x c -i t (m) = 1 — Gyt{m), where G is defined in the same way as G, except 
that v is replaced by v. Therefore, by a change of time units, we may suppose with- 
out loss of generality that v is a probability measure. In that case, the Lyapunov 
function V may be written as 

V(x) = D(p || x-v)- log ^ x{m) v{dm)^j + E [S(X X - U )} , 

where D(- \\ •) denotes the relative entropy or Kullback-Leibler distance between two 
probability measures, x, :— (J x(m) v(dm))^ 1 x is x renormalized to be a probability 
density with respect to the measure v, x ■ v is the probability measure with density 
x with respect to v, x ■ v is the finite measure with density x with respect to v, and 
I 1 " is a Poisson random measure with intensity x ■ v. Therefore, our system (|2.3j) 
evolves in such a way that it tries to decrease the sum of the expected selective 
cost of an individual chosen at random from the population, E[S'(X' : ' t )], and the 
Kullback-Leibler distance between the probability measures v and pt(A'l) _1 pt, while 
not allowing the expected total number of ancestral mutation events present in the 
of a randomly chosen individual, pt{M) — E[X Pt {M)\, to be too small. 

We note that the sum of a Kullback-Leibler distance and the logarithm of ex- 
pected selective cost appears as a Lyapunov function for the continuous-time, single- 
locus, n-allele, mutation-selection model of }Hof85j in which the mutation rate from 
allele i to allele j only depends on the target allele j. This Lyapunov function is 
used there to prove a global stability result. Also, a Kullback-Leibler distance is a 
Lyapunov function in a neighborhood of the stable state of the frequency-dependent 
selection model of [Bom91 that builds on the evolutionary game theory work of 
Aki82 , where a similar quantity also appears. 

4. Some technical lemmas 

We collect together in this section some results used in the proof of Theorem l3.ll 
The following result is elementary, but we include it for completeness. 

Lemma 4.1. Suppose that f G L 1 (A / l n , v® n ) for some positive integer n. The 
function 



x i * / x(mi)---x{m n )f(mi,...,m n )v® n (dm) 

is weak*- continuous on any norm bounded subset of L c ^{M.,v). 

Proof. This is obvious for functions of the form 

/(mi, . . . ,m„) = Ji(mi) . . . /„(m„), 

where ft, . . . , f n £ L 1 (A / (, v\ and the result for general / follows from the fact that 
finite linear combinations of such functions are dense in L 1 (A / f™, v® n ). □ 
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Recall the definition of the map T : L^(M,f) — ► L+{M,v) and the set R C 
L^(A4, v) from (13. 5|) and ()3.8|) . respectively. Recall also that T maps R into itself. 

Lemma 4.2. The map T : R — > i? is weak* -continuous. 

Proof. Suppose that is a sequence in R such that x k converges to x € R in the 
weak* topology as k — > oo. Fix a test function / G L 1 (A^, f). Then 



Txk{rn)f{rn)v{drn)— / Tx(m)f{rn)v(drn) 



M 



M 



a„(m,m) [xfc(mi) • • •xa : (to„_i) - ar(mi) • • -x(m„_i)] i/®( n-1 )(dm) 



n=2 
X 



x v(dm 



where 



H(m):=[y^n a„(m, m)x k (mi) ... x k {m n -i) v® (n ^(dm) 

x f^n y a„(m)x(m 1 )...a;(TO„_i)^ ( "- 1) (rfm)^ > £ 2 . 



Therefore, 

Txk{m)f{rn)v{drn)— I Tx(m)f(m)i/(dm) 



M 



M 



N 



x v(dm). 
Each integral 



a n (m,m) [^(mi) • • -x fc (m„_i) - ar(mi) • ■ -x{m n -i)] ^® ( ™ ^(dm) 



/ a n (m,m) [x k {mi) ■ ■ ■Xk(m n -i) - x(m{) ■ ■ -x(m„_i)] z^® ( ™ 1) (dm) 

belongs to L°°(A^, ^) with norm that is bounded in k. Moreover, each such integral 
converges to zero as k goes to infinity by Lemma |4. 11 It follows from the Dominated 
Convergence Theorem that 



lim 

k — >oc 



Txk{m)f{m)u{dm)— / Yx(m)f{rn)v{drn) 



M 



M 



= 0, 



as required. 

Recall the definition of the function V : LT(M, v) -> K from (|3~15| . 



□ 



Lemma 4.3. For any constants < a < b < oo, the function V restricted to the 
set 

{x £ L°?{M.,v) : a < essinfx(m) < esssupa;(m) < b} 
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is upper semicontinuous in the weak* topology. 

Proof. Fix a domain D C L+(M, v) of the type considered in the statement of the 
lemma. 

Observe that 

V(x)=- I -j—u{dm)+H{x), 
Jm x ( m ) 

where H : D — > R is a sum of functions, each of the form 

g(mi, . . . ,m k )x(mi) ■ ■ ■ x(m k ) v(dm\) ■ ■ ■ v(dm k ) 

for some function g 6 L oa (M k , v® k ). It therefore suffices by Lemma [4.11 to show 
that the function 

(4.1) x i > / 1 v(dm) 

Jm x ( m ) 

is weak*-upper semicontinuous on D 

Suppose without loss of generality that v is a probability measure. Let H n = 
(A ni , . . . , A n ) be a sequence of partitions of M such that the er-fields gen- 
erated by the successive partitions form a filtration J- n with the property that 
c(U n Fn) = B(M.), the Borel cr-field on M.. Consider ifflasa random variable 
on the probability space (A4, B(M.), v), and let x^- 71 ' :— E[x \ be the condi- 
tional expectation of x given T n . Note that x^ n > is a non- negative martingale with 
a < x^(m) < b for ^-a.e. m 6 M. For m G A nq we have 

x^ n \m) = ——r — r / x(m) v(dm), 
v \A nq ) J 

with the convention that 0/0 = 0. The function taking x to each such integral is 
weak*-continuous. As a result, the function 



v{dm) = - V] v{A riq 



I \ / \ ^\ 7 — / 

;W m z — ' 



i 

x{m) v(dm) 



<A nq ) 



is also weak*-continuous. 

By Jensen's inequality for conditional expectation, the sequence of random vari- 
ables — l/a;W is a non-positive supermartingale, and so the sequence of expectations 

[x (n) (m)] -1 i/(dm) = E[-l/x (n) ] 

is non-increasing. By the Martingale Convergence Theorem (see, for example, The- 
orem IV- 1-2 of |Nev75j ). the sequence x^ n \m) converges to x(m) for i^-a.e. m G M.. 
Therefore, — J M [x^ (m)] _1 v(dm) decreases to — J M x(m) _1 i>{dm) by the Domi- 
nated Convergence Theorem. 

We can thus write the function x i— » — J M x(m) -1 v(dm) as the infimum of the 
family of functions x i— » — j M [x^ (m)] _1 v{dm), each of which is weak*-continuous. 
Consequently, the function x i— > — a;(rn) _1 v(dm) is weak*-upper semicontinu- 
ous. □ 
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